Libya
Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering
Large Language Models (LLMs) are widely used for knowledge-seeking purposes yet suffer from hallucinations. The knowledge boundary of an LLM limits its factual understanding, beyond which it may begin to hallucinate. Investigating the perception of LLMs' knowledge boundary is crucial for detecting hallucinations and LLMs' reliable generation. Current studies perceive LLMs' knowledge boundary on questions with concrete answers (close-ended questions) while paying limited attention to semi-open-ended questions that correspond to many potential answers. Some researchers achieve it by judging whether the question is answerable or not. However, this paradigm is not so suitable for semi-open-ended questions, which are usually "partially answerable questions" containing both answerable answers and ambiguous (unanswerable) answers.
Meta AI adds Arabic support for Middle East and North Africa
As large language models face growing criticism for their lack of language inclusivity beyond the English-dominated West, leading AI companies have started tailoring regional-specific LLMs to break this cycle. Now, Meta is riding that wave. Meta is expanding Meta AI across the Middle East and North Africa (aka, MENA) as it provides language support for millions of Arabic-speaking users in Algeria, Egypt, Iraq, Jordan, Libya, Morocco, Saudi Arabia, Tunisia, the United Arab Emirates (UAE), and Yemen. Similarly, Mistral AI recently released its first Arabic-centric language model, Saba, which is tailored to meet the needs of its growing customer base in Arabic-speaking countries. Meta AI, an AI-powered chatbot and virtual assistant based on Llama 3.2, is available on Facebook, Instagram, WhatsApp, and Messenger.
Grounding Continuous Representations in Geometry: Equivariant Neural Fields
Wessels, David R, Knigge, David M, Papa, Samuele, Valperga, Riccardo, Vadgama, Sharvaree, Gavves, Efstratios, Bekkers, Erik J
Recently, Neural Fields have emerged as a powerful modelling paradigm to represent continuous signals. In a conditional neural field, a field is represented by a latent variable that conditions the NeF, whose parametrisation is otherwise shared over an entire dataset. We propose Equivariant Neural Fields based on cross attention transformers, in which NeFs are conditioned on a geometric conditioning variable, a latent point cloud, that enables an equivariant decoding from latent to field. Our equivariant approach induces a steerability property by which both field and latent are grounded in geometry and amenable to transformation laws if the field transforms, the latent represents transforms accordingly and vice versa. Crucially, the equivariance relation ensures that the latent is capable of (1) representing geometric patterns faitfhully, allowing for geometric reasoning in latent space, (2) weightsharing over spatially similar patterns, allowing for efficient learning of datasets of fields. These main properties are validated using classification experiments and a verification of the capability of fitting entire datasets, in comparison to other non-equivariant NeF approaches. We further validate the potential of ENFs by demonstrate unique local field editing properties.
Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Lynn, Teresa, Altakrori, Malik H., Magdy, Samar Mohamed, Das, Rocktim Jyoti, Lyu, Chenyang, Nasr, Mohamed, Samih, Younes, Aji, Alham Fikri, Nakov, Preslav, Godbole, Shantanu, Roukos, Salim, Florian, Radu, Habash, Nizar
The rapid evolution of Natural Language Processing (NLP) has favored major languages such as English, leaving a significant gap for many others due to limited resources. This is especially evident in the context of data annotation, a task whose importance cannot be underestimated, but which is time-consuming and costly. Thus, any dataset for resource-poor languages is precious, in particular when it is task-specific. Here, we explore the feasibility of repurposing existing datasets for a new NLP task: we repurposed the Belebele dataset (Bandarkar et al., 2023), which was designed for multiple-choice question answering (MCQA), to enable extractive QA (EQA) in the style of machine reading comprehension. We present annotation guidelines and a parallel EQA dataset for English and Modern Standard Arabic (MSA). We also present QA evaluation results for several monolingual and cross-lingual QA pairs including English, MSA, and five Arabic dialects. Our aim is to enable others to adapt our approach for the 120+ other language variants in Belebele, many of which are deemed under-resourced. We also conduct a thorough analysis and share our insights from the process, which we hope will contribute to a deeper understanding of the challenges and the opportunities associated with task reformulation in NLP research.
Sandra -- A Neuro-Symbolic Reasoner Based On Descriptions And Situations
Lazzari, Nicolas, De Giorgis, Stefano, Gangemi, Aldo, Presutti, Valentina
This paper presents sandra, a neuro-symbolic reasoner combining vectorial representations with deductive reasoning. Sandra builds a vector space constrained by an ontology and performs reasoning over it. The geometric nature of the reasoner allows its combination with neural networks, bridging the gap with symbolic knowledge representations. Sandra is based on the Description and Situation (DnS) ontology design pattern, a formalization of frame semantics. Given a set of facts (a situation) it allows to infer all possible perspectives (descriptions) that can provide a plausible interpretation for it, even in presence of incomplete information. We prove that our method is correct with respect to the DnS model. We experiment with two different tasks and their standard benchmarks, demonstrating that, without increasing complexity, sandra (i) outperforms all the baselines (ii) provides interpretability in the classification process, and (iii) allows control over the vector space, which is designed a priori.
Adaptive Fusion of Multi-view Remote Sensing data for Optimal Sub-field Crop Yield Prediction
Mena, Francisco, Pathak, Deepak, Najjar, Hiba, Sanchez, Cristhian, Helber, Patrick, Bischke, Benjamin, Habelitz, Peter, Miranda, Miro, Siddamsetty, Jayanth, Nuske, Marlon, Charfuelan, Marcela, Arenas, Diego, Vollmer, Michaela, Dengel, Andreas
Accurate crop yield prediction is of utmost importance for informed decision-making in agriculture, aiding farmers, and industry stakeholders. However, this task is complex and depends on multiple factors, such as environmental conditions, soil properties, and management practices. Combining heterogeneous data views poses a fusion challenge, like identifying the view-specific contribution to the predictive task. We present a novel multi-view learning approach to predict crop yield for different crops (soybean, wheat, rapeseed) and regions (Argentina, Uruguay, and Germany). Our multi-view input data includes multi-spectral optical images from Sentinel-2 satellites and weather data as dynamic features during the crop growing season, complemented by static features like soil properties and topographic information. To effectively fuse the data, we introduce a Multi-view Gated Fusion (MVGF) model, comprising dedicated view-encoders and a Gated Unit (GU) module. The view-encoders handle the heterogeneity of data sources with varying temporal resolutions by learning a view-specific representation. These representations are adaptively fused via a weighted sum. The fusion weights are computed for each sample by the GU using a concatenation of the view-representations. The MVGF model is trained at sub-field level with 10 m resolution pixels. Our evaluations show that the MVGF outperforms conventional models on the same task, achieving the best results by incorporating all the data sources, unlike the usual fusion results in the literature. For Argentina, the MVGF model achieves an R2 value of 0.68 at sub-field yield prediction, while at field level evaluation (comparing field averages), it reaches around 0.80 across different countries. The GU module learned different weights based on the country and crop-type, aligning with the variable significance of each data source to the prediction task.
Deep Blind Super-Resolution for Satellite Video
Xiao, Yi, Yuan, Qiangqiang, Zhang, Qiang, Zhang, Liangpei
Recent efforts have witnessed remarkable progress in Satellite Video Super-Resolution (SVSR). However, most SVSR methods usually assume the degradation is fixed and known, e.g., bicubic downsampling, which makes them vulnerable in real-world scenes with multiple and unknown degradations. To alleviate this issue, blind SR has thus become a research hotspot. Nevertheless, existing approaches are mainly engaged in blur kernel estimation while losing sight of another critical aspect for VSR tasks: temporal compensation, especially compensating for blurry and smooth pixels with vital sharpness from severely degraded satellite videos. Therefore, this paper proposes a practical Blind SVSR algorithm (BSVSR) to explore more sharp cues by considering the pixel-wise blur levels in a coarse-to-fine manner. Specifically, we employed multi-scale deformable convolution to coarsely aggregate the temporal redundancy into adjacent frames by window-slid progressive fusion. Then the adjacent features are finely merged into mid-feature using deformable attention, which measures the blur levels of pixels and assigns more weights to the informative pixels, thus inspiring the representation of sharpness. Moreover, we devise a pyramid spatial transformation module to adjust the solution space of sharp mid-feature, resulting in flexible feature adaptation in multi-level domains. Quantitative and qualitative evaluations on both simulated and real-world satellite videos demonstrate that our BSVSR performs favorably against state-of-the-art non-blind and blind SR models. Code will be available at https://github.com/XY-boy/Blind-Satellite-VSR
Predicting Confinement Effect of Carbon Fiber Reinforced Polymers on Strength of Concrete using Metaheuristics-based Artificial Neural Networks
Wahab, Sarmed, Suleiman, Mohamed, Shabbir, Faisal, Mahmoudabadi, Nasim Shakouri, Waqas, Sarmad, Herl, Nouman, Ahmad, Afaq
Keywords: carbon fiber reinforced polymer, concrete, confinement effect, strength, particle swarm optimization, grey wolf optimizer, bat algorithm Abstract This article deals with the study of predicting the confinement effect of carbon fiber reinforced polymers (CFRPs) on concrete cylinder strength using metaheuristics-based artificial neural networks. Three metaheuristic models are implemented including particle swarm optimization (PSO), grey wolf optimizer (GWO), and bat algorithm (BA). These algorithms are trained on the data using an objective function of mean square error and their predicted results are validated against the experimental studies and finite element analysis. The study shows that the hybrid model of PSO predicted the strength of CFRP-confined concrete cylinders with maximum accuracy of 99.13% and GWO predicted the results with an accuracy of 98.17%. The high accuracy of axial compressive strength predictions demonstrated that these prediction models are a reliable solution to the empirical methods. The prediction models are especially suitable for avoiding full-scale time-consuming experimental tests that make the process quick and economical. 1 Introduction Fiber-reinforced polymer is a composite material comprising fibers of either glass, aramid, or carbon and a polymer matrix. These fibers improve the properties of the polymer matrix mechanically including its stiffness and strength. The popularity of these composites has increased significantly in civil engineering due to their ability to strengthen concrete structural members. FRPs can be used either as a bar or plates embedded in concrete as an internal reinforcement and can be used as an external reinforcement by wrapping FRP sheets to existing structural members. The FRP bars have significantly higher strength than the steel reinforcement bars. They are highly durable and resistant to chemicals, corrosion (Cousin et al. 2019, Ananthkumar et al. 2020, Zhang et al. 2020), and radiation, their higher strength-to-weight ratio (Zhou et al. 2019) makes them ideal for structures that require high strength but need not be heavy. They can be molded into any required shape that provides higher design flexibility. Moreover, it has a lower environmental impact (Lee and Jain 2009), unlike concrete and timber.
Evaluating The Accuracy of Classification Algorithms for Detecting Heart Disease Risk
Alariyibi, Alhaam, El-Jarai, Mohamed, Maatuk, Abdelsalam
The healthcare industry generates enormous amounts of complex clinical data that make the prediction of disease detection a complicated process. In medical informatics, making effective and efficient decisions is very important. Data Mining (DM) techniques are mainly used to identify and extract hidden patterns and interesting knowledge to diagnose and predict diseases in medical datasets. Nowadays, heart disease is considered one of the most important problems in the healthcare field. Therefore, early diagnosis leads to a reduction in deaths. DM techniques have proven highly effective for predicting and diagnosing heart diseases. This work utilizes the classification algorithms with a medical dataset of heart disease; namely, J48, Random Forest, and Na\"ive Bayes to discover the accuracy of their performance. We also examine the impact of the feature selection method. A comparative and analysis study was performed to determine the best technique using Waikato Environment for Knowledge Analysis (Weka) software, version 3.8.6. The performance of the utilized algorithms was evaluated using standard metrics such as accuracy, sensitivity and specificity. The importance of using classification techniques for heart disease diagnosis has been highlighted. We also reduced the number of attributes in the dataset, which showed a significant improvement in prediction accuracy. The results indicate that the best algorithm for predicting heart disease was Random Forest with an accuracy of 99.24%.
Niger coup: Will the West change its security approach to the Sahel?
Until the recent military takeover in July 2023, Niger had played a key role in the security architecture of the West, particularly the United States and France, in the Sahel region. Niger hosts US and French military bases, while international support in different fields has increased exponentially in recent years. For example, take the 500 million euros ($546m) provided by the European Union in 2021, 120 million euros ($131m) of aid from France in 2022 or $150m of direct aid announced during US Secretary of State Antony Blinken's visit to Niamey in March 2023. This is one of the reasons why Niger had a relatively secure environment that did not allow violent armed activities to a large extent. Even though casualties from "terrorist attacks" increased worldwide after 2021, the loss of civilians in Niger decreased by 80 percent in 2022.